Requirement already satisfied: plotly in ./venv/lib/python3.10/site-packages (5.13.0)
Requirement already satisfied: tenacity>=6.2.0 in ./venv/lib/python3.10/site-packages (from plotly) (8.1.0)

[notice] A new release of pip available: 22.2.2 -> 23.0.1
[notice] To update, run: pip install --upgrade pip
Requirement already satisfied: folium in ./venv/lib/python3.10/site-packages (0.14.0)
Requirement already satisfied: numpy in ./venv/lib/python3.10/site-packages (from folium) (1.24.1)
Requirement already satisfied: branca>=0.6.0 in ./venv/lib/python3.10/site-packages (from folium) (0.6.0)
Requirement already satisfied: jinja2>=2.9 in ./venv/lib/python3.10/site-packages (from folium) (3.1.2)
Requirement already satisfied: requests in ./venv/lib/python3.10/site-packages (from folium) (2.28.2)
Requirement already satisfied: MarkupSafe>=2.0 in ./venv/lib/python3.10/site-packages (from jinja2>=2.9->folium) (2.1.2)
Requirement already satisfied: urllib3<1.27,>=1.21.1 in ./venv/lib/python3.10/site-packages (from requests->folium) (1.26.14)
Requirement already satisfied: charset-normalizer<4,>=2 in ./venv/lib/python3.10/site-packages (from requests->folium) (3.0.1)
Requirement already satisfied: certifi>=2017.4.17 in ./venv/lib/python3.10/site-packages (from requests->folium) (2022.12.7)
Requirement already satisfied: idna<4,>=2.5 in ./venv/lib/python3.10/site-packages (from requests->folium) (3.4)

[notice] A new release of pip available: 22.2.2 -> 23.0.1
[notice] To update, run: pip install --upgrade pip
Requirement already satisfied: mlxtend in ./venv/lib/python3.10/site-packages (0.21.0)
Requirement already satisfied: matplotlib>=3.0.0 in ./venv/lib/python3.10/site-packages (from mlxtend) (3.6.3)
Requirement already satisfied: joblib>=0.13.2 in ./venv/lib/python3.10/site-packages (from mlxtend) (1.2.0)
Requirement already satisfied: scikit-learn>=1.0.2 in ./venv/lib/python3.10/site-packages (from mlxtend) (1.2.1)
Requirement already satisfied: pandas>=0.24.2 in ./venv/lib/python3.10/site-packages (from mlxtend) (1.5.3)
Requirement already satisfied: numpy>=1.16.2 in ./venv/lib/python3.10/site-packages (from mlxtend) (1.24.1)
Requirement already satisfied: scipy>=1.2.1 in ./venv/lib/python3.10/site-packages (from mlxtend) (1.10.0)
Requirement already satisfied: setuptools in ./venv/lib/python3.10/site-packages (from mlxtend) (63.2.0)
Requirement already satisfied: cycler>=0.10 in ./venv/lib/python3.10/site-packages (from matplotlib>=3.0.0->mlxtend) (0.11.0)
Requirement already satisfied: pillow>=6.2.0 in ./venv/lib/python3.10/site-packages (from matplotlib>=3.0.0->mlxtend) (9.4.0)
Requirement already satisfied: python-dateutil>=2.7 in ./venv/lib/python3.10/site-packages (from matplotlib>=3.0.0->mlxtend) (2.8.2)
Requirement already satisfied: kiwisolver>=1.0.1 in ./venv/lib/python3.10/site-packages (from matplotlib>=3.0.0->mlxtend) (1.4.4)
Requirement already satisfied: contourpy>=1.0.1 in ./venv/lib/python3.10/site-packages (from matplotlib>=3.0.0->mlxtend) (1.0.7)
Requirement already satisfied: pyparsing>=2.2.1 in ./venv/lib/python3.10/site-packages (from matplotlib>=3.0.0->mlxtend) (3.0.9)
Requirement already satisfied: packaging>=20.0 in ./venv/lib/python3.10/site-packages (from matplotlib>=3.0.0->mlxtend) (23.0)
Requirement already satisfied: fonttools>=4.22.0 in ./venv/lib/python3.10/site-packages (from matplotlib>=3.0.0->mlxtend) (4.38.0)
Requirement already satisfied: pytz>=2020.1 in ./venv/lib/python3.10/site-packages (from pandas>=0.24.2->mlxtend) (2022.7.1)
Requirement already satisfied: threadpoolctl>=2.0.0 in ./venv/lib/python3.10/site-packages (from scikit-learn>=1.0.2->mlxtend) (3.1.0)
Requirement already satisfied: six>=1.5 in ./venv/lib/python3.10/site-packages (from python-dateutil>=2.7->matplotlib>=3.0.0->mlxtend) (1.16.0)

[notice] A new release of pip available: 22.2.2 -> 23.0.1
[notice] To update, run: pip install --upgrade pip
Out[4]:
InvoiceNo StockCode Description Quantity InvoiceDate UnitPrice CustomerID Country
0 536365 85123A WHITE HANGING HEART T-LIGHT HOLDER 6 12.1.10 8:26 2.55 17850.0 United Kingdom
1 536365 71053 WHITE METAL LANTERN 6 12.1.10 8:26 3.39 17850.0 United Kingdom
2 536365 84406B CREAM CUPID HEARTS COAT HANGER 8 12.1.10 8:26 2.75 17850.0 United Kingdom
3 536365 84029G KNITTED UNION FLAG HOT WATER BOTTLE 6 12.1.10 8:26 3.39 17850.0 United Kingdom
4 536365 84029E RED WOOLLY HOTTIE WHITE HEART. 6 12.1.10 8:26 3.39 17850.0 United Kingdom
... ... ... ... ... ... ... ... ...
541904 581587 22613 PACK OF 20 SPACEBOY NAPKINS 12 12.9.11 12:49 0.85 12680.0 France
541905 581587 22899 CHILDREN'S APRON DOLLY GIRL 6 12.9.11 12:49 2.10 12680.0 France
541906 581587 23254 CHILDRENS CUTLERY DOLLY GIRL 4 12.9.11 12:49 4.15 12680.0 France
541907 581587 23255 CHILDRENS CUTLERY CIRCUS PARADE 4 12.9.11 12:49 4.15 12680.0 France
541908 581587 22138 BAKING SET 9 PIECE RETROSPOT 3 12.9.11 12:49 4.95 12680.0 France

541909 rows × 8 columns

Out[5]:
InvoiceNo StockCode Description Quantity InvoiceDate UnitPrice CustomerID Country
0 536365 85123A WHITE HANGING HEART T-LIGHT HOLDER 6 12.1.10 8:26 2.55 17850.0 United Kingdom
1 536365 71053 WHITE METAL LANTERN 6 12.1.10 8:26 3.39 17850.0 United Kingdom
2 536365 84406B CREAM CUPID HEARTS COAT HANGER 8 12.1.10 8:26 2.75 17850.0 United Kingdom
3 536365 84029G KNITTED UNION FLAG HOT WATER BOTTLE 6 12.1.10 8:26 3.39 17850.0 United Kingdom
4 536365 84029E RED WOOLLY HOTTIE WHITE HEART. 6 12.1.10 8:26 3.39 17850.0 United Kingdom
Out[6]:
Quantity UnitPrice CustomerID
count 541909.000000 541909.000000 406829.000000
mean 9.552250 4.611114 15287.690570
std 218.081158 96.759853 1713.600303
min -80995.000000 -11062.060000 12346.000000
25% 1.000000 1.250000 13953.000000
50% 3.000000 2.080000 15152.000000
75% 10.000000 4.130000 16791.000000
max 80995.000000 38970.000000 18287.000000
Check if we had negative quantity and prices at same register: No 

Check how many register we have where quantity is negative and prices is 0 or vice-versa: 1336

What is the customer ID of the registers above: [nan]

% Negative Quantity: 1.96%

All register with negative quantity has Invoice start with: ['C']

See an example of negative quantity and others related records:
InvoiceNo StockCode Description Quantity InvoiceDate UnitPrice CustomerID Country
2 536365 84406B CREAM CUPID HEARTS COAT HANGER 8 12.1.10 8:26 2.75 17850.0 United Kingdom
51 536373 84406B CREAM CUPID HEARTS COAT HANGER 8 12.1.10 9:01 2.75 17850.0 United Kingdom
68 536375 84406B CREAM CUPID HEARTS COAT HANGER 8 12.1.10 9:31 2.75 17850.0 United Kingdom
280 536396 84406B CREAM CUPID HEARTS COAT HANGER 8 12.1.10 10:50 2.75 17850.0 United Kingdom
418 536406 84406B CREAM CUPID HEARTS COAT HANGER 8 12.1.10 11:32 2.75 17850.0 United Kingdom
3194 536609 84406B CREAM CUPID HEARTS COAT HANGER 6 12.2.10 9:40 2.75 17850.0 United Kingdom
3226 536612 84406B CREAM CUPID HEARTS COAT HANGER 6 12.2.10 9:43 2.75 17850.0 United Kingdom
3427 536628 84406B CREAM CUPID HEARTS COAT HANGER 6 12.2.10 10:54 2.75 17850.0 United Kingdom
3445 536630 84406B CREAM CUPID HEARTS COAT HANGER 6 12.2.10 10:56 2.75 17850.0 United Kingdom
3810 536685 84406B CREAM CUPID HEARTS COAT HANGER 6 12.2.10 12:23 2.75 17850.0 United Kingdom
3825 536690 84406B CREAM CUPID HEARTS COAT HANGER 6 12.2.10 12:24 2.75 17850.0 United Kingdom
4163 536750 84406B CREAM CUPID HEARTS COAT HANGER 6 12.2.10 14:03 2.75 17850.0 United Kingdom
4184 536752 84406B CREAM CUPID HEARTS COAT HANGER 6 12.2.10 14:06 2.75 17850.0 United Kingdom
4512 536787 84406B CREAM CUPID HEARTS COAT HANGER 8 12.2.10 15:24 2.75 17850.0 United Kingdom
4528 536790 84406B CREAM CUPID HEARTS COAT HANGER 12 12.2.10 15:26 2.75 17850.0 United Kingdom
Out[10]:
InvoiceNo              object
StockCode              object
Description            object
Quantity                int64
InvoiceDate    datetime64[ns]
UnitPrice             float64
CustomerID              Int64
Country                object
amount                float64
Internal               object
dtype: object
Out[11]:
Description total_count
3918 WHITE HANGING HEART T-LIGHT HOLDER 2369
2915 REGENCY CAKESTAND 3 TIER 2200
1866 JUMBO BAG RED RETROSPOT 2159
2471 PARTY BUNTING 1727
2054 LUNCH BAG RED RETROSPOT 1638
... ... ...
1433 FRYING PAN RED POLKADOT 1
3048 S/4 BLACK DISCO PARTITION PANEL 1
3065 SCALLOP SHELL SOAP DISH 1
1380 FOUND 1
4222 wrongly sold sets 1

4223 rows × 2 columns

/home/ayush/Documents/Project/ayush/customer_segm_RFM/venv/lib/python3.10/site-packages/plotly/io/_renderers.py:395: DeprecationWarning:

distutils Version classes are deprecated. Use packaging.version instead.

/home/ayush/Documents/Project/ayush/customer_segm_RFM/venv/lib/python3.10/site-packages/plotly/io/_renderers.py:395: DeprecationWarning:

distutils Version classes are deprecated. Use packaging.version instead.

Out[13]:
InvoiceNo StockCode Description Quantity InvoiceDate UnitPrice CustomerID Country amount Internal
0 536365 85123A WHITE HANGING HEART T-LIGHT HOLDER 6 2010-12-01 08:26:00 2.55 17850 United Kingdom 15.30 Yes
1 536365 71053 WHITE METAL LANTERN 6 2010-12-01 08:26:00 3.39 17850 United Kingdom 20.34 Yes
2 536365 84406B CREAM CUPID HEARTS COAT HANGER 8 2010-12-01 08:26:00 2.75 17850 United Kingdom 22.00 Yes
3 536365 84029G KNITTED UNION FLAG HOT WATER BOTTLE 6 2010-12-01 08:26:00 3.39 17850 United Kingdom 20.34 Yes
4 536365 84029E RED WOOLLY HOTTIE WHITE HEART. 6 2010-12-01 08:26:00 3.39 17850 United Kingdom 20.34 Yes
Total rows having negative quantity = 10624
Total rows having not less than 1 quantity = 531285
Total rows which will be using for further analysis = 531285
Out[16]:
Description amount
1072 DOTCOM POSTAGE 206248.77
2867 REGENCY CAKESTAND 3 TIER 174484.74
2401 PAPER CRAFT , LITTLE BIRDIE 168469.60
3860 WHITE HANGING HEART T-LIGHT HOLDER 106292.77
2427 PARTY BUNTING 99504.33
1827 JUMBO BAG RED RETROSPOT 94340.05
2065 MEDIUM CERAMIC TOP STORAGE JAR 81700.92
2205 Manual 78112.82
2706 POSTAGE 78101.88
2755 RABBIT NIGHT LIGHT 66964.99
Out[23]:
InvoiceNo 0
0 536365 7
1 536366 2
2 536367 12
3 536368 4
4 536369 1
... ... ...
20723 581586 4
20724 581587 15
20725 A563185 1
20726 A563186 1
20727 A563187 1

20728 rows × 2 columns

Out[24]:
Description 4 PURPLE FLOCK DINNER CANDLES 50'S CHRISTMAS GIFT BAG LARGE DOLLY GIRL BEAKER I LOVE LONDON MINI BACKPACK I LOVE LONDON MINI RUCKSACK NINE DRAWER OFFICE TIDY OVAL WALL MIRROR DIAMANTE RED SPOT GIFT BAG LARGE SET 2 TEA TOWELS I LOVE LONDON SPACEBOY BABY GIFT SET ... returned taig adjust test to push order througha s stock was website fixed wrongly coded 20713 wrongly coded 23343 wrongly marked wrongly marked 23343 wrongly sold (22719) barcode
InvoiceNo
536365 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 ... 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
536366 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 ... 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
536367 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 ... 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
536368 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 ... 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
536369 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 ... 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

5 rows × 4077 columns

Out[25]:
Description 4 PURPLE FLOCK DINNER CANDLES 50'S CHRISTMAS GIFT BAG LARGE DOLLY GIRL BEAKER I LOVE LONDON MINI BACKPACK I LOVE LONDON MINI RUCKSACK NINE DRAWER OFFICE TIDY OVAL WALL MIRROR DIAMANTE RED SPOT GIFT BAG LARGE SET 2 TEA TOWELS I LOVE LONDON SPACEBOY BABY GIFT SET ... returned taig adjust test to push order througha s stock was website fixed wrongly coded 20713 wrongly coded 23343 wrongly marked wrongly marked 23343 wrongly sold (22719) barcode
InvoiceNo
536365 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
536366 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
536367 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
536368 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
536369 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0

5 rows × 4077 columns

Total number of baskets used for analysis= 20136
The minimum support value for the analysis is  1.4899 %
/home/ayush/Documents/Project/ayush/customer_segm_RFM/venv/lib/python3.10/site-packages/mlxtend/frequent_patterns/fpcommon.py:111: DeprecationWarning:

DataFrames with non-bool types result in worse computationalperformance and their support might be discontinued in the future.Please use a DataFrame with bool type

Out[27]:
support
count 711.000000
mean 0.024390
std 0.011732
min 0.014899
25% 0.017084
50% 0.020461
75% 0.026793
max 0.112237
Out[28]:
antecedents consequents antecedent support consequent support support confidence lift leverage conviction
476 (REGENCY CAKESTAND 3 TIER, PINK REGENCY TEACUP... (GREEN REGENCY TEACUP AND SAUCER) 0.016438 0.050407 0.014948 0.909366 18.040379 0.014120 10.477174
490 (PINK REGENCY TEACUP AND SAUCER, ROSES REGENCY... (GREEN REGENCY TEACUP AND SAUCER) 0.029748 0.050407 0.026917 0.904841 17.950627 0.025417 9.979054
478 (REGENCY CAKESTAND 3 TIER, PINK REGENCY TEACUP... (ROSES REGENCY TEACUP AND SAUCER ) 0.017034 0.052940 0.014948 0.877551 16.576330 0.014047 7.734323
464 (REGENCY CAKESTAND 3 TIER, PINK REGENCY TEACUP... (GREEN REGENCY TEACUP AND SAUCER) 0.019468 0.050407 0.017034 0.875000 17.358621 0.016053 7.596742
120 (CHARLOTTE BAG PINK POLKADOT, STRAWBERRY CHARL... (RED RETROSPOT CHARLOTTE BAG) 0.018822 0.051351 0.016190 0.860158 16.750626 0.015223 6.783737
492 (PINK REGENCY TEACUP AND SAUCER, GREEN REGENCY... (ROSES REGENCY TEACUP AND SAUCER ) 0.031436 0.052940 0.026917 0.856240 16.173782 0.025253 6.587791
470 (REGENCY CAKESTAND 3 TIER, PINK REGENCY TEACUP... (ROSES REGENCY TEACUP AND SAUCER ) 0.019468 0.052940 0.016438 0.844388 15.949899 0.015408 6.086025
592 (REGENCY TEA PLATE GREEN ) (REGENCY TEA PLATE ROSES ) 0.018921 0.022100 0.015842 0.837270 37.886013 0.015424 6.009355
458 (PINK REGENCY TEACUP AND SAUCER) (GREEN REGENCY TEACUP AND SAUCER) 0.038041 0.050407 0.031436 0.826371 16.393893 0.029519 5.469083
416 (CHARLOTTE BAG PINK POLKADOT, WOODLAND CHARLOT... (RED RETROSPOT CHARLOTTE BAG) 0.019617 0.051351 0.016140 0.822785 16.022819 0.015133 5.353092
/home/ayush/Documents/Project/ayush/customer_segm_RFM/venv/lib/python3.10/site-packages/plotly/io/_renderers.py:395: DeprecationWarning:

distutils Version classes are deprecated. Use packaging.version instead.

/home/ayush/Documents/Project/ayush/customer_segm_RFM/venv/lib/python3.10/site-packages/plotly/io/_renderers.py:395: DeprecationWarning:

distutils Version classes are deprecated. Use packaging.version instead.

/home/ayush/Documents/Project/ayush/customer_segm_RFM/venv/lib/python3.10/site-packages/plotly/io/_renderers.py:395: DeprecationWarning:

distutils Version classes are deprecated. Use packaging.version instead.

/home/ayush/Documents/Project/ayush/customer_segm_RFM/venv/lib/python3.10/site-packages/plotly/io/_renderers.py:395: DeprecationWarning:

distutils Version classes are deprecated. Use packaging.version instead.

/home/ayush/Documents/Project/ayush/customer_segm_RFM/venv/lib/python3.10/site-packages/plotly/io/_renderers.py:395: DeprecationWarning:

distutils Version classes are deprecated. Use packaging.version instead.

/home/ayush/Documents/Project/ayush/customer_segm_RFM/venv/lib/python3.10/site-packages/plotly/io/_renderers.py:395: DeprecationWarning:

distutils Version classes are deprecated. Use packaging.version instead.

Out[33]:
antecedents consequents antecedent support consequent support support confidence lift leverage conviction
476 (REGENCY CAKESTAND 3 TIER, PINK REGENCY TEACUP... (GREEN REGENCY TEACUP AND SAUCER) 0.016438 0.050407 0.014948 0.909366 18.040379 0.014120 10.477174
490 (PINK REGENCY TEACUP AND SAUCER, ROSES REGENCY... (GREEN REGENCY TEACUP AND SAUCER) 0.029748 0.050407 0.026917 0.904841 17.950627 0.025417 9.979054
478 (REGENCY CAKESTAND 3 TIER, PINK REGENCY TEACUP... (ROSES REGENCY TEACUP AND SAUCER ) 0.017034 0.052940 0.014948 0.877551 16.576330 0.014047 7.734323
464 (REGENCY CAKESTAND 3 TIER, PINK REGENCY TEACUP... (GREEN REGENCY TEACUP AND SAUCER) 0.019468 0.050407 0.017034 0.875000 17.358621 0.016053 7.596742
120 (CHARLOTTE BAG PINK POLKADOT, STRAWBERRY CHARL... (RED RETROSPOT CHARLOTTE BAG) 0.018822 0.051351 0.016190 0.860158 16.750626 0.015223 6.783737
... ... ... ... ... ... ... ... ... ...
485 (PINK REGENCY TEACUP AND SAUCER, GREEN REGENCY... (REGENCY CAKESTAND 3 TIER, ROSES REGENCY TEACU... 0.031436 0.026073 0.014948 0.475513 18.237978 0.014129 1.856916
439 (DOTCOM POSTAGE) (SUKI SHOULDER BAG) 0.035161 0.026321 0.016687 0.474576 18.030317 0.015761 1.853131
407 (WOODLAND CHARLOTTE BAG) (CHARLOTTE BAG PINK POLKADOT) 0.041468 0.036899 0.019617 0.473054 12.820206 0.018086 1.827703
109 (CHARLOTTE BAG SUKI DESIGN) (STRAWBERRY CHARLOTTE BAG) 0.043802 0.035856 0.020610 0.470522 13.122468 0.019039 1.820931
107 (RED RETROSPOT CHARLOTTE BAG) (STRAWBERRY CHARLOTTE BAG) 0.051351 0.035856 0.024136 0.470019 13.108462 0.022295 1.819206

141 rows × 9 columns